Automated virtual appliance sizing

ABSTRACT

Methods and apparatus, including computer program products, are provided for sizing the operating system of a virtual machine. In one aspect, there is provided a computer-implemented method. The computer-implemented method includes receiving information representative of an extended application layer and a container, both of which operate at a virtual machine at a host. A determination is made regarding whether one or more aspects of the extended application and the container may be deinstalled. The extended application and the container are implemented without the deinstalled aspects. Related apparatus, systems, methods, and articles are also described.

FIELD

The present disclosure generally relates to virtual machines and, in particular, to sizing aspects associated with a virtual machine.

BACKGROUND

Computers have now become an integral part of our society both in business and in residential settings. Almost every business of sufficient size in Germany, the United States, and other developed countries has one or more computers to assist them in running their businesses. Similarly, many families in those countries now have computers at home that are used to run various applications including games.

Some attribute the popularity of the computers to the Internet. The Internet provides people with a ready access to vast amounts of data. Many people now get their news, sports, stock, entertainment, and other information primarily from the Internet. Businesses have also embraced the Internet. The Internet provides the opportunity for computers to communicate instantly with other computers or individuals. Business processes that were once restricted to intranets and their users are now moving to the Internet. Accordingly, companies are moving more and more of their data to electronic forms. In addition, companies have amassed huge amounts of data in an effort to understand their business, improve performance, and build stronger employee, customer, and partner relationships.

Virtualization is a technology for optimizing processing at a computer. Virtualization provides a software layer that when executed allows multiple virtual machines with, in some cases, different operating systems to run side-by-side with other virtual machines running on the same host (e.g., a physical machine, such as a node, a computer, a processor, a server, a blade, and the like). A virtual machine is a machine that is defined and implemented in software rather than hardware. The virtualization software provides a so-called “wrapper” that wraps and isolates the virtual machine from other virtual machines. For example, in a server complex including fifty physical servers, each of which hosts its own application server, virtualization permits the server complex to instead operate with, for example, twenty-five physical servers, each of which includes virtualization software providing two virtual machines to operate as application servers. In both cases, fifty application servers are deployed, but with virtualization, the number of physical servers is reduced to twenty-five. Virtualization software may also control one or more of the following functions: running multiple virtual machines with different operating systems at the same time on the same physical machine; generating fully configured isolated virtual machines with a set of virtual hardware including an operating system and applications; saving, copying, and provisioning of virtual machines; and moving virtual machines from one physical machine to another physical machine for workload management.

An example of virtualization software is a hypervisor (also referred to as a virtual machine controller or, more simply, a controller). VMWare ESX and VMware Server are examples of hypervisor software for virtualizing an operating environment (including operating system, IP (Internet Protocol) addresses, registries, and other aspects normally found in a computer system) to provide a virtual machine. The hypervisor software may control (or manage) the physical machine's processor, memory, and other resources enabling the virtual operating environments (i.e., systems).

The virtual machine may include an operating system. The operating system may include one or more applications. An operating system (OS) is the program that, after being initially loaded into the computer by a boot program, manages other programs on computer. The other programs are called applications (also referred to as application programs). The application may make use of the operating system by making requests for services through one or more application program interfaces (API). An application may perform one or more specific functions (or tasks) directly for a user or, in some cases, another program or application program. Examples of applications include spreadsheets, word processing, browsers, and the like. For example, a virtual machine may include an operating system, such as Linux or Windows Vista, and one or more application programs, such as a browser, all of which operate in the so-called “container” provided by the virtual machine. In some cases, the virtual machine may also include some data for use by the application.

The phrase “virtual appliance” refers to an example of a virtual machine that may include the application, operating system, and other items (e.g., data, drivers, and etc.) to enable simplification of the installation and the configuration process associated with running the application. An example of a virtual appliance is the MediaWiki software that powers Wikipedia, which is available as a virtual appliance. The MediaWiki appliance contains all the necessary software, including operating system, database, and MediaWiki, to run a wiki installation as a so-called “black box.”

SUMMARY

The subject matter disclosed herein provides methods and apparatus, including computer program products, for sizing aspects associated with a virtual machine.

In one aspect, there is provided a computer-implemented method. The computer-implemented method includes receiving information representative of an extended application layer and a container, both of which operate at a virtual machine at a host. A determination is made regarding whether one or more aspects of the extended application and the container may be deinstalled. The extended application and the container are implemented without the deinstalled aspects.

The subject matter described herein may be implemented to realize the advantage of reducing the size of virtual machine implementations.

It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive. Further features and/or variations may be provided in addition to those set forth herein. For example, the implementations described herein may be directed to various combinations and subcombinations of the disclosed features and/or combinations and subcombinations of several further features disclosed below in the detailed description.

BRIEF DESCRIPTION OF THE DRAWINGS

In the drawings,

FIG. 1 depicts a block diagram of a system 100 including virtual machines;

FIG. 2 depicts a process 200 for sizing;

FIG. 3 depicts another process 300 for sizing; and

FIG. 4 depicts another process 400 for sizing.

Like labels are used to refer to same or similar items in the drawings.

DETAILED DESCRIPTION

Virtual machines are used to run applications in a flexible way. For example, virtual machines simplify adding resources, such as processing power, network capacity, memory capacity, and/or storage capacity, since such resources can be added virtually without necessarily requiring the addition of physical resources. Virtual machines also simplify moving applications from one physical machine to another or isolating applications from other applications. The uses of virtual machines will more than likely increase in the future.

To illustrate uses of virtual machines, virtual machines may be used in home banking to isolate applications. For example, most consumers fear that a computer virus may be used in an unwanted manner (e.g., to scan a password or illegally manipulate a banking transaction). A virtual machine may be used to minimize that risk by building a virtual machine that is isolated, i.e., used only for home banking. No other programs are installed or allowed to run on that virtual machine and no Internet browsing is done from this virtual machine save for browsing of the banking web site. In this configuration, the virtual machine provides an isolated home banking application that is less vulnerable to attack from, for example, a virus. In another example implementation, a virtual machine may be used in an Enterprise Service Oriented Architecture (ESOA). In such architecture, each service in the ESOA may run in its own virtual machine—providing isolation between different services and providing scalability of the services (e.g., varying virtual machine resources, such as processing power, memory, storage, bandwidth, and etc.). Although the above describes two example uses of virtual machines, other uses may be implemented as well.

Although virtual machines will increase in use, a possible impediment to their use is that virtual machines are installed with a full technology stack including a physical host, an operating system, a container, an application, services, and/or business processes. The services are typically applications with specific capabilities, such as security or remote usability, and business processes are a combination of services. The phrase “extended application layer” is used to refer to one or more of the following: an application, a service, and a business process. The use of a complete technology stack causes unnecessary overhead in terms of increased main memory consumption, increased disk storage, and so forth. The subject matter described herein relates to sizing the technology stack of an application operating in a virtual machine to reduce unnecessary aspects of that technology stack. In some implementations, the sizing is based entirely on the optimization of the applications (and higher levels like services and business processes) and the container infrastructure (e.g., a J2EE container, an ABAP container, a .NET container, and the like). In the case of complex, large enterprise software systems, there is much more to gain with respect to sizing the applications and the container infrastructure layer than merely sizing aspects of the operating system layer.

FIG. 1 depicts a system 100 including one or more hosts 116 a-116 b, a configurator 140, and storage 145, all of which may be coupled by a network 150 or any other communication mechanism including the Internet, intranet, inter-process communications, and the like.

Hosts 116 a-b may be implemented as physical machines (e.g., processors, computers, servers, blades, and the like) capable of hosting one or more virtual machines 114 a-b. The virtual machines 114 a-b, implemented on hosts 116 a-b, may be implemented as software for creating a virtualization layer between the host and the host's operating system. In some implementations, the virtual machines 114 a-b may be implemented as a virtual appliance, which refers to a virtual machine that includes an application, an operating system and, in some cases, data to configure and/or operate the application. Moreover, the virtual machines 114 a-b may each run an application 112 a-c (e.g., a software program, an application program, a component of an application program, an applet, or the like). In some implementations, each of the applications 112 a-c may have an operating system 118 a-b, such as Microsoft Windows, Microsoft Vista, Java, Sun OS, Linux, or any other operating system. The applications may also operate within containers 122 a-c, such as a J2EE container, an ABAP container, a .NET container, or the like.

Agents 120 a-b may be used to receive information representative of the container infrastructure or the extended applications layer installed on the host. The phrase container infrastructure refers to one or more containers, such as containers 122 a-c. For example, agent 120 a may be implemented to run in (e.g., as part of) the container infrastructure and receive information representative of the resources being used by application 112 a while running in the container.

In some implementations, agent 120 a may be incorporated into the container infrastructure or the extended application layer, although agent 120 a may be implemented in other locations as well. In some implementations, agents 120 a-b may be implemented as a web service. A service, such as a web service, is an application that may make itself available over the Internet or an intranet, may use standardized messaging, such as XML (eXtensible Markup Language) and Simple Object Access Protocol (SOAP), and may use some type of location mechanism, such as UDDI (Universal Description, Discovery, and Integration), to locate the service and its public Application Program Interface (API). In any case, agents 120 a-b may receive information and then forward any received information to configurator 140, where the information is stored in storage 145.

The configurator 140 may receive information from one or more agents 120 a-b, and store the received information in storage 145. Configurator 140 may receive such information from an agent over a predetermined period of time. Configurator 140 may determine, based on the received information, whether any aspects of a container infrastructure or the extended application layer may be disabled or deinstalled. For example, agents 120 a may receive information representative of the interaction, such as communications, between application 112 a and the container infrastructure. The received information may correspond to one or more of the following: the activation of application 112 a inside the container infrastructure; the usage of application 112 a; and any other aspect of the extended application layer that may enable configurator 140 to determine what aspects of the container infrastructure or the extended application layer may be deinstalled (e.g., removed or disabled).

Configurator 140 may, based on the received information, initiate the implementation of the container infrastructure or the extended application layer without one or more aspects of the container infrastructure or the extended application layer. For example, configurator 140 initiates the de-installation of certain parts of that are not used by application 112 a.

FIG. 2 depicts a process 200 for sizing in a system implementing virtualization technology, such as virtual machines 114 a-b.

At 210, an agent, such as agent 120 a, may receive information representative of the operation of the extended application layer, such as application 112 a, and the container infrastructure layer, such as container 122 a. For example, agent 120 a may monitor and receive communications between application 112 a and container 122 a. In some implementations, agent 120 a passively monitors communications between application 112 a and container 122 a, and then receives a copy of only those messages that may be relevant to processing at 220. Agent 120 a may provide the received information to configurator 140 though network 150, where that information is stored in storage 145.

At 220, configurator 140 uses the received information to determine what unnecessary aspects of the container infrastructure layer or the extended application layer may be deinstalled or disabled. Moreover, the received information may represent information gathered over a predetermined period of time. For example, the received information may correspond to messages sent between components of the extended applications, such as application 112 a, and the container infrastructure, such as container 122 a over a predetermined period of time, such as 1 minute, 1 hour, 1 day, 1 week, or any other period of time. If an aspect is not used during the period of time, that aspect of the extended application layer or container infrastructure may be identified for possible removal. In some cases, a set of predefined tests may be run to determine what aspects are unnecessary. When this is the case, time may not be a factor, so the predetermined period of time aspect is typically used only when there are no predefined tests.

To further illustrate, the received information stored in storage 145 may indicate that one or more of the following aspects may be removed as part of sizing: a software component, such as a business intelligence (BI) component or system, a search engine, a mobile engine support, and the like; one or more services in a container, such as the security service, the messaging service, the backup service, one or more log services, the monitoring service, the alerting service, and the like; in the case of ABAP, one or more work processes; in the case of Java, limiting the number and resource consumption of EJB (Enterprise Java Beans); certain modules/functions (e.g., encapsulated functionality within a computer program); database tables or other unused areas in a database; and main memory allocated by the extended application layer in the container infrastructure layer. As such, configurator 140 may determine which aspects may be deinstalled or disabled based on the received information.

At 230, configurator 140 may initiate the implementation (e.g., modification, re-installation, removal, and the like) of a container infrastructure and/or an extended application layer without the aspects identified in 220. In some implementations, configurator 140 may initiate the deinstallation of those aspects determined in 220. For example, configurator 140 may initiate deinstallation of those aspects determined in 220 or may initiate the reboot the container infrastructure and the extended application layer so that the aspects determined in 220 are not installed (e.g., disabled, removed, and the like). Returning to the above example, configurator 140 may initiate the de-installation (or reduction) of the following aspects of the container infrastructure and the extended application layer: a software component, such as a business intelligence (BI) component or system, a search engine, a mobile engine support, and the like; one or more services in a container, such as the security service, the messaging service, the backup service, one or more log services, the monitoring service, the alerting service, and the like; in the case of ABAP, one or more work processes; in the case of Java, limiting the number and resource consumption of EJB (Enterprise Java Beans); certain modules/functions (e.g., encapsulated functionality within a computer program); database tables or other unused areas in a database; and main memory allocated by the extended application layer in the container infrastructure layer.

FIG. 3 depicts another process 300 for sizing a container infrastructure or extended application layer in a system implementing virtualization technology, such as virtual machines 114 a-b.

At 310, configurator 140 may initiate installation of a complete (i.e., full) container infrastructure and extended application layer at a virtual machine, such as virtual machines 114 a-b. For example, configurator 140 may install or initiate another program to install the complete extended application layer as well as the container infrastructure.

At 330, configurator 140 initiates installation of agents 120 a-b. Configurator 140 may install agents 120 a-b as part of the container infrastructure (e.g. a service in a J2EE engine), as part of the extended application layer, or as part of any other aspect of system 100. In some implementations, agents 120 a-b may be pre-installed (e.g., as a patch, upgrade, and the like) as part of the container infrastructure, the extended application layer, or a combination of both. In any case, configuration 140 may initiate installation by sending a message that initiates installation of an agent or activates a pre-installed agent.

At 340-360, configurator 140 may use a process similar to the one described above with respect to 210-230 at FIG. 2.

FIG. 4 depicts another process 400 for sizing an extended application layer and the underlying container infrastructure layer in a system implementing virtualization technology, such as virtual machines 114 a-b.

At 405, one or more tests are run on an extended application layer and/or a container.

In some implementations, the extended application layer and container infrastructure layer may be tested in a predetermined order. The predetermined order may test the extended application layer, container infrastructure layer, as well as other layers (e.g., operating systems) from smallest (in terms of, for example, size, number of components, and the like) to largest. Moreover, sets of preconfigured extended application layers and container infrastructure layers may be tested as well. To illustrate, if the smallest extended application layer and container infrastructure, such as the “absolute minimum” configuration passes unit testing (i.e., it works with the application at the virtual machine), that absolute minimum configuration is used; otherwise; the next container infrastructure (e.g., the “middle-weight”) is tested and used if it passes testing; and so forth.

At 440, configurator 140 may receive the test information. The test information may represent operation of the extended application layer and the container infrastructure layer tested. For example, if testing determines that each of the container infrastructures operated properly (e.g., passed its tests) with the application layer, configurator 140 may then select the so-called “absolute minimum” container infrastructure since this represents the minimum that works properly with the application.

At 460, configurator 140 may initiate implementation (e.g., modify, install, re-install, and the like) of the extended application layer and the container infrastructure based on the results of 450, as described above with respect to 360.

Although FIG. 1 depicts system 100, other configurations of the components of system 100 may be implemented as well. For example, any number of hosts, virtual machines, agents, and applications may be implemented. Moreover, although FIG. 1 depicts a single virtual machine on each host, a plurality of virtual machines may be implemented on each host. Moreover, in some implementations, configurator 140 is incorporated into virtualization software, such as software used to manage one or more virtual machines.

Although the above example describes receiving information regarding the container and extended applications layer, other information, such as operating system information, may be received as well, and aspects of the operating system may be sized in accordance with the mechanisms described herein. Moreover, the term user refers to any entity including another processor, such as a host computer.

The systems and methods disclosed herein may be embodied in various forms including, for example, a data processor, such as a computer that also includes a database, digital electronic circuitry, firmware, software, or in combinations of them. Moreover, the above-noted features and other aspects and principles of the present disclosed embodiments may be implemented in various environments. Such environments and related applications may be specially constructed for performing the various processes and operations according to the disclosed embodiments or they may include a general-purpose computer or computing platform selectively activated or reconfigured by code to provide the necessary functionality. The processes disclosed herein are not inherently related to any particular computer, network, architecture, environment, or other apparatus, and may be implemented by a suitable combination of hardware, software, and/or firmware. For example, various general-purpose machines may be used with programs written in accordance with teachings of the disclosed embodiments, or it may be more convenient to construct a specialized apparatus or system to perform the required methods and techniques.

The systems and methods disclosed herein may be implemented as a computer program product, i.e., a computer program tangibly embodied in an information carrier, e.g., in a machine readable storage device or in a propagated signal, for execution by, or to control the operation of, data processing apparatus, e.g., a programmable processor, a computer, or multiple computers. A computer program can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program can be deployed to be executed on one computer or on multiple computers at one site or distributed across multiple sites and interconnected by a communication network.

The foregoing description is intended to illustrate but not to limit the scope of the invention, which is defined by the scope of the appended claims. Other embodiments are within the scope of the following claims. 

1. A computer-readable medium containing instructions to configure a processor to perform a method, the method comprising: receiving information representative of at least one of an extended application layer and a container infrastructure layer, the extended application layer and the container infrastructure layer operating at a virtual machine at a host; determining, based on the received information, whether one or more aspects of the at least one of the extended application layer and the container infrastructure layer are to be deinstalled or disabled; and implementing the at least one of extended application layer and the container infrastructure layer based on the results of the determining.
 2. The computer-readable medium of claim 1, wherein receiving further comprises: receiving information using an agent for monitoring the extended application layer and the container infrastructure layer.
 3. The computer-readable medium of claim 2 further comprising: providing the received information to a configurator for determining, based on the received information, whether the one or more aspects of the extended application layer and the container infrastructure layer may be deinstalled or disabled from the extended application layer and the container infrastructure layer.
 4. The computer-readable medium of claim 1, wherein determining further comprises: determining at least one of the one or more aspects as a component of the extended application layer or a component the container infrastructure layer that is not being used by a user of the extended application layer at the virtual machine at the host, the host comprising one or more of the following: a node, a computer, a processor, a server, and a blade.
 5. The computer-readable medium of claim 1, wherein implementing further comprises: implementing the extended application layer and the container infrastructure layer by deinstalling at least one of the one or more aspects, when at least one aspect is not used by a user of the extended application layer.
 6. The computer-readable medium of claim 1 further comprising: initiating installation of the container infrastructure at the virtual machine; initiating installation of the extended application layer at the virtual machine; and initiating installation of the agent, the agent installed as part of at least one of the extended application layer or the container infrastructure layer.
 7. The computer-readable medium of claim 1 further comprising: testing at the virtual machine preconfigured sets of extended application layers and container infrastructures.
 8. The computer-readable medium of claim 7 further comprising: implementing at least one of the sets based on the testing.
 9. The computer-readable medium of claim 7 further comprising: testing the at least one of the sets using a predetermined order for each of the combinations.
 10. The computer-readable medium of claim 1, further comprising: implementing the virtual machine as a virtual appliance.
 11. A computer-implemented method comprising: receiving information representative of at least one of an extended application layer and a container infrastructure layer, the extended application layer and the container infrastructure layer operating at a virtual machine at a host; determining, based on the received information, whether one or more aspects of the at least one of the extended application layer and the container infrastructure layer are to be deinstalled or disabled; and implementing the at least one of extended application layer and the container infrastructure layer based on the results of the determining.
 12. The computer-implemented method of claim 11, wherein receiving further comprises: receiving information using an agent for monitoring the extended application layer and the container infrastructure layer.
 13. The computer-implemented method of claim 12 further comprising: providing the received information to a configurator for determining, based on the received information, whether the one or more aspects of the extended application layer and the container infrastructure layer may be deinstalled or disabled from the extended application layer and the container infrastructure layer.
 14. The computer-implemented method of claim 11, wherein determining further comprises: determining at least one of the one or more aspects as a component of the extended application layer or a component the container infrastructure layer that is not being used by a user of the extended application layer at the virtual machine at the host, the host comprising one or more of the following: a node, a computer, a processor, a server, and a blade.
 15. The computer-implemented method of claim 11, wherein implementing further comprises: implementing the extended application layer and the container infrastructure layer by deinstalling at least one of the one or more aspects, when at least one aspect is not used by a user of the extended application layer.
 16. The computer-implemented method of claim 11 further comprising: initiating installation of the container infrastructure at the virtual machine; initiating installation of the extended application layer at the virtual machine; and initiating installation of the agent, the agent installed as part of at least one of the extended application layer or the container infrastructure layer.
 17. The computer-implemented method of claim 11 further comprising: testing at the virtual machine preconfigured sets of extended application layers and container infrastructures.
 18. The computer-implemented method of claim 17 further comprising: implementing at least one of the sets based on the testing.
 19. The computer-implemented method of claim 17 further comprising: testing the at least one of the sets using a predetermined order for each of the combinations.
 20. A system comprising: a processor; and a memory, wherein the processor and the memory are configured to perform a method comprising: receiving information representative of at least one of an extended application layer and a container infrastructure layer, the extended application layer and the container infrastructure layer operating at a virtual machine at a host; determining, based on the received information, whether one or more aspects of the at least one of the extended application layer and the container infrastructure layer are to be deinstalled or disabled; and implementing the at least one of extended application layer and the container infrastructure layer based on the results of the determining. 